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Information content in the halo-model dark-matter power 
spectrum II: Multiple cosmological parameters 
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ABSTRACT 

We investigate the cosmological Fisher information in the non-linear dark-matter 
power spectrum in the context of the halo model. We find that there is a plateau 
in information content on translinear scales which is generic to all cosmological pa- 
rameters we tried. There is a rise in information on smaller scales, but we find that 
it is quite degenerate among different cosmological parameters (except, perhaps, the 
tilt). This suggests that it could be difficult to constrain cosmological parameters us- 
ing the non-linear regime of the dark-matter power spectrum. We suggest ways to get 
around this problem, such as removing the largest haloes from consideration in survey 
analysis. 
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1 INTRODUCTION 

The distribution of matter in the Universe on large scales 
contains a wealth of cosmological information. Even though 
galaxy redshift surveys provide a huge amount of data about 
galaxy clustering on non-linear scales, it is unclear how use- 
ful these smaller scales are cosmologically. Using iV-body 
I simulations, Rimes & Hamilton (2005, RH05; 2006, RH06) 
investigated the amount of cosmological information, as 
a function of scale, in the matter power spectrum P(k). 
They found that information about In A, the initial am- 
plitude of the linear power spectrum, is preserved in P(k) 
on large, linear scales, and that there is significant infor- 
mation on small scales, but there is little independent in- 
formation on translinear scales ( k ~ 0.2 — 0.8 h^ 1 Mpc). 
iNevrinck. Szapudi fc Rimes! (|2006l . Paper I) found that the 
halo model also predicts this translinear plateau in the infor- 
mation about In A in P(k). Paper I showed that the translin- 
ear plateau came largely from cosmic variance in the number 
of the largest haloes in a given volume. 

In this Letter, we extend the analysis of Paper I, look- 
ing at how well P(k) can constrain multiple cosmological 
parameters simultaneously. 



2 METHOD 

The Fisher information matrix F a p abou t parameters 
a and /3 given a set of data is defined (|Fisherl 1 19351 ; 
iTegmark. Taylor fc Heavens! [l997l as the concavity of the 
natural logarithm of the likelihood function C, averaged over 
an ensemble of data predicted by C; 



F a f3 



d 2 ln£(data|a, (3, priors) \ 



da d/3 



1 



(1) 



In this Letter, we investigate the (hereafter, implicitly 
non-linear, dark-matter) power spectrum Pi — P(ki) (actu- 
ally, In Pi), measured in bins of wavenumber fe. The cumu- 
lative Fisher information over a range of bin indices i 6 Tt 
is 



dlnPj d 2 \n£ dlnP, 
da din Ptd In Pj d/3 



(2) 



To simplify this, we approximate the expectation value of 
the data Fisher matrix as C _1 , the inverse of the data covari- 
ance matrix CV, = (A In Pi A In Pj). This approximation is 
good if estimates of In Pi have Gaussian distributions about 
their expectation values. This seems to be adequately so for 
measurements of the dark- matter power spectrum (RH06), 
as the central limit theorem would encourage one to think. 
We denote the data covariance matrix as C (with Fisher 
matrix C" 1 ), and denote the parameter Fisher matrix as F 
(with covariance matrix F _1 ). Equation (J2J) becomes 



F af3 (1Z) = 



£ 



dlnPi ! dhxPj 



da 



dii 



(3) 



where C-r. is the square submatrix of C with both indices 
ranging over 7?.. 

The definition in Eqn. <(3j is equivalent to the one used 
in RH06, except that it bypasses the step of explicit upper- 
Cholesky decorrelation. However, Eqn. © differs from the 
erroneous definition used in RH05 and Paper I, for which 
derivative terms were used only on the diagonal. This pre- 
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Figure 1. Different definitions of cumulative information, 
Fin A about In A in the dark-matter power spectrum 

Ca(^ fe) is the notation used in Paper I.) The red, dashed curve 
uses the incorrect, 'old' information definition, used in Paper I and 
RH05, while the black curve uses the correct definition, used in 
this Letter, and RH06. The dotted curves show what RH06 found 
from TV-body simulations of box size 128 and 256 h~ 1 Mpc, using 
the 'new' definition. 



vious definition can be written 

Ed In Pi c 
da 



In 



00 



E 



(C<i-l)im 



(4) 



where C^i is the upper-left square submatrix of C with 
indices only through i. Figure [T] shows the difference this 
makes in the cumulative information in In A, using the same 
cosmology as in Paper I. The difference is small, but could 
be larger for a parameter with derivative terms farther from 
unity than In A. We also show the measurements from sim- 
ulations (RH06), calculated using Eqn. 

2.1 Covariance matrix construction 

We use the same procedure for the matter power spectrum 
covariance matrix as in Paper I. The covariance of the power 
spectrum in a survey of volume V is the sum of a Gaussian 
term, which depends on the square of the power spectrum 
itself, and a term i nvolving the (hereafter, implicitly non- 
linear) trispectrum (lHamilton. Rimes fc Scoccimarrol I 2006; 
IScoccimarro. Zaldarriaga fc Huilll999l . SZH); 



(2tt) 

Vs.i 



-2P(fo)% + T i3 



(•>) 



where V s ,i is the volume of shell i in Fourier space (propor- 
tional to ki for logarithmically spaced bins), and Tij is the 
trispectrum averaged over shells i and j; 

T(fc i ,-fc i ,fc J ,-fc i )^i^*i. (6) 

Vs,i Vs,j 



T, 



W e use the halo-model formalism l|Coorav fc Shethl 
2002) to get the non-linear matter power spectrum and 



trispectrum. In the halo model, the universe is assumed to 
consist of virialized haloes distributed according to leading- 
order perturbation theory (the first-order, linear, power 
spectrum, the second-order bispectrum, and the third-order 
trispectrum). In the halo model, the power spectrum is the 
sum of one- and two-halo terms, and the trisp ectrum is the 
sum of one-, two-, t hree-, and four-halo terms (|Coorav||200ll ; 
ICoorav fc Hull200ll . CH). For example, the power spectrum 
is 

P(k) = P lh (fc) + P 2h (k) (7) 
= M? 2 (k,k) + P Un (k)[Ml(k)] 2 , (8) 

where are integrals over the halo mass function; 



M£(*i,...,* M ); 



b/3(m)n(m, c) 



xu(ki,m, c) ■ ■ ■ i^fc^, m, c) dc dm. 



(9) 



Here, p is the mean matter density, m is the halo 
mass, c is the halo concentration in the NFW profile 



llNavarro. Frenk fc White! Il996l). bsjm) is the /3-order halo 
bias l|Mo. Jing fc Whitejll997l ; IScoccimarro et al.ll200ll ). and 
u(k, m, c) is the halo profile in Fourier space, normalized to 
unity at k = 0. In Eqn. ©, we assume that the power spec- 
trum P hh of a set of haloes is a uniformly biased linear power 
spectrum P lm , even though this se ems not to be quite true 
l|Smith. Scoccimarro fc Shethll2006l . SSS). 

For the covariance matrix, we use use the same 
halo-model inpu t s as did CH, except that we use a 
ISheth fc Tormenl {l999) halo mass function. The baseline 
cosmology is a flat concordance mo del, specifically tha t 
found recently by the WMAP team l|Spergel et al.l l2006l ). 
except that n = 1, i.e. {Q, m h 2 = 0.127, Q b h 2 = 0.0223, n = 
1, as = 0.74). For the input linear power spectrum, we use 
the cam^B code. 



2.2 Parameter derivatives 

The other major ingredients in our analysis are the deriva- 
tives of In Pi with respect to parameters of interest. In 
Paper I, we varied \nA by small amounts, and thus cal- 
culated d\n Pi/ din A numerically. In this Letter, we do 
the same with the parameters (ln^l, n, \nh, \nff,, \nQ m h 2 ). 
Here, A and n are the scalar amplitude and tilt of 
the power spectrum, using the default CAMB pivot point, 
at 0.05 Mpc -1 . The parametrized Hubble constant h = 
-ffo/(100km sec -1 Mpc" 1 ), and fb is the baryon fraction 
Qb/Qm, where fib is the baryon density, and £l m is the sum 
of fib and the dark-matter density, fl c . We assume a flat 
Universe throughout. 

We tried two different methods to calculate the param- 
eter derivatives: using our halo model code; and, using the 
ha lofit (impl e mente d in CAMB) fitting formula developed 
bv lSmith etal] (120031 ). Each method has its advantages: us- 
ing our code would be self-consistent, but halofit has been 
extensively tested against iV-body simulations. We expected 
the results to be similar, though, since halofit was devel- 
oped in the spirit of halo models, having both quasi-linear 
and self-halo terms. 



1 See http: //camb . info/ 
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Figure 2. Parameter derivatives 9 In P(k) j da (where a is the 
parameter shown on the left), computed using two methods : our 
halo model code (black curves) and the ISmith et al.l ||2003|) fit- 
ting function (green curves). Where the curves are dashed, the 
derivatives are negative. The red dotted 8 In P(k)/d In A curve 
was measured from 400 iV-body simulations by RH05. 



Figure [2] is a comparison of the derivative terms from 
the two methods. While they give similar results on lin- 
ear scales, there are qualitative differences on non-linear 
scales. In general, halofit predicts greater derivative terms 
than our code. We also show din P(k)/d\n A as measured 
from 400 128-particle PM TV-body simulations of box size 
256 ZiMpc -1 run by RH05 (using a slightly different cos- 
mology). Although both methods are roughly consistent 
with the simulation measurement, the continued decrease on 
smaller scales which occurs in our code is somewhat more 
plausible than the upturn seen with halofit. The methods 
differ most dramatically for In h; our halo model code pre- 
dicts a tiny variation with In h on small scales, while halofit 
predicts a variation comparable to other cosmological pa- 
rameters. 

The two models give different predictions on small 
scales because of different ways the one-halo terms P lh 
(dominant on small scales) are defined. In the halo model, 
P depends only on the abundances and concentrations of 
haloes of different masses; these depend on the variance in 
the linear density field smoothed with a top-hat filter of 
radius r, cr 2 (r). Keeping all of our other parameters fixed, 
changing h merely shifts lnP lm (fc) horizontally and verti- 
cally. For this cosmological model, the shift is such that 
P hn (k) does not change if the local slope dlnP lln (fc)/dlnfc 
reaches a value ~ —2. This happens at small scales, so <r 2 (r) 
does not change appreciably for small r. Thus, the abun- 
dance and concentration of small haloes varies only slightly 
with ln/t, and P lh hardly changes on small scales. On the 
other hand, the halofit power spectrum does vary with 
In ft on small scales, since it explicitly depends on fi m . (In 
both cases, we hold Q. m h 2 constant.) 



3 RESULTS 

The parameter Fisher matrix F a p gives predictions of sta- 
tistical error bars and error ellipsoids, assuming that the 
likelihood functions are Gaussian. This assumption does not 
precisely hold, but is adequate to look for trends. Holding 
all other parameters fixed, the variance in a parameter a is 
1/F aa , while the variance marginalized over other param- 
eters is (F~ 1 ) aa , where F is the Fisher matrix including a 
and the other parameters. 

Figure [3^ shows how constraints on our chosen pa- 
rameters change with the smallest scale (largest wavenum- 
ber) used in the analysis. We use a survey volume of 
256 ft. -1 Mpc, and a fixed lowest wavenumber, 0.02 ftMpc -1 
« 27r/(256 h _1 Mpc). In Fig. EJ instead of the rather ab- 
stract quantity of information, we show probably more fa- 
miliar error-bar half- widths. 

Diagonal plots show error-bar half- widths, both un- 
marginalized, 1/V F aa (black), and marginalized over all 
four other parameters, W (F~ 1 ) aa (green). The solid and 
dashed curves use the halo-model and halofit derivative 
terms, respectively. For example, the information about In A 
as plotted in Fig. Q] appears (to the —1/2 power) in the solid 
black curve in the upper-left plot. 

Off-diagonal plots show correlation coefficients R a p = 
(F in the marginalized parame- 

ter covariance matrix F containing all five parameters. If 
R a f3 ~ ±1, then an error ellipse is squashed along a diagonal 
line; if R a p = 0, then the ellipse is circular. 

The main conclusions of this Letter come from Fig. [3^,. 
The cumulative information in a parameter varied alone gen- 
erally has the characteristics found in RH05 and Paper I: 
there is a plateau on translinear scales, followed by a rise on 
fully non- linear scales. This rise appears as a drop in Fig.[3K 
which shows the information to the power —1/2. Unfortu- 
nately, this small-scale error-bar tightening is quite degen- 
erate among parameters. The green curves on the diagonal 
display this clearly; with the possible exception of the tilt, 
marginalized error-bar half- widths level off in the translinear 
regime, and never significantly decrease as smaller scales are 
included in the analysis. This degeneracy is also visible in 
the correlation coefficients, many of which are nearly ±1 on 
small scales. Small scales provide an increased lever arm for 
constraining the tilt on small scales, but the marginalized 
error bars in the tilt only tighten significantly if using our 
code's derivative terms, not those from halofit. 

Why is the small-scale rise in information so degenerate 
among parameters? As discussed above, P lh is entirely de- 
termined by the abundances and concentrations of haloes, 
which depend on integrals (over top-hat window functions) 
of the linear power spectrum. Altering any single parame- 
ter will indeed change these integrals, but this change will 
generally be close to a monolithic shift up or down in P lh . 
Thus, changing any parameter will have a similar effect on 
the small-scale power spectrum. 

Our previous plots have shown the cumulative infor- 
mation up to a wavenumber fc max , holding fc m i n and the 
volume fixed. In Fig. |3]d, we crudely explore the effects of 
survey size. We show unmarginalized error-bar half-widths, 
computed using derivatives from our code (not halofit), 
changing the volume of the survey, but not the range of 
scales measured. For a box size b, we assume that P(k) can 
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Figure 3. (A, Upper-right). One-sigma error-bar half-widths on various cosmological parameters as a function of the highest 
wavelength considered, fcmax • The lowest wavelength is held constant at 0.02 ftMpc" , and a volume of (256 h~ 1 Mpc) 3 is used. Plots 
on the diagonal show error-bar half- widths in single parameters, both unmarginalized (black) and marginalized over all four other 
parameters (green). For the solid curves, we use our halo model code for parameter derivatives; for the dashed, we use HALOFIT. 
Off-diagonal plots show correlation coefficients between pairs of parameters, in the marginalized covariance matrix F _1 . 
(B, Lower-left). A crude exploration of practical issues of survey size. We show unmarginalized one-sigma error-bar half- widths in 
various cosmological parameters, holding the dynamic range of scales used constant, at a factor of 10, and using parameter derivatives 
from our halo model code. The volume of the box changes with fcmax = 10fc m in; we imagine measuring the power spectrum in a box of 
volume (27r/fc m i n ) 3 . 



be measured from fc m in = 2n/b to fcmax = lOfcmin. Here, the 
translinear plateau takes the form of a steep ramp upward 
at fcmax ~ 1 ftMpc" . 

3.1 Looking in rural areas of the Universe 

In Paper I, we argued that the translinear information 
plateau in P(k) is caused by cosmic variance in the number 
of the largest haloes in a given volume, since on translinear 
scales, large haloes dominate P(k). A potential way around 
this problem is to model the contribution of the largest 
haloes to the power spectrum. Another way could be to 
remove the 'noise' of the largest haloes from the analysis. 
Even beyond the cosmic- variance argument, it is plausible 
that cosmological information is especially obscured in large 
haloes, in advanced stages of non-linear collapse. This is 
not a new idea; for example, much cosmological information 



seems to lie in the low-overdensity Lyman-a lpha forest (e.g. 
ICroft et al.ll2002l ; iGnedin fe Hamilton! |2002l ). 

To investigate the potential of cutting out large haloes 
from the analysis, we truncate the halo-model integrals over 
the halo mass function at various upper mass cut-offs. For 
this calculation, we use only the dominant lh and 2h terms 
in the trispectrum; we explain why in the rest of the para- 
graph. In this Letter, we assume that the P hh and T hh are 
given by leading-order-perturbation-theory (LOPT). This is 
not quite the case (SSS), but more accurate estimates of P hh 
and T hh are not currently known. The LOPT power spec- 
trum P lin is first-order, but the LOPT trispectrum T° (3) 
is third-order; as SZH point out, using both of them to- 
gether is inconsistent. Indeed, we find that doing so in Eq. 
[S] gives off-diagonal entries exceeding unity in the correla- 
tion matrix, violating the Schwarz inequality. These unruly 
entries are in the translinear regime, where T '- 3 ' may still 
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Figure 4. The effect of introducing a mass cut-off in the halo 
mass function on error-bar half-widths on In A. Again, V = 
(256 ftMpc -1 ) 3 . The information in the power spectrum of mat- 
ter in low-mass haloes is potentially greater than in the full mat- 
ter distribution. For the solid curves, only the dominant one- and 
two-halo terms of the halo-model trispectrum are included; for 
the dashed curve, the full trispectrum is used, including the three- 
and four-halo terms. 



accurately trace the non-linear trispectrum, but P lm does 
not trace the non-linear power spectrum. Using the full halo 
mass spectrum in the halo model, this inaccuracy does not 
matter substantially, since where P lin and T° (3) may not 
accurately trace P hh and T hh (in the non- linear regime), 
the terms in the trispectrum which involve them are buried 
under other terms. So, we use all terms in the halo-model 
trispectrum when using the full halo mass function. On the 
other hand, if large haloes are missing from the mass spec- 
trum, the raw P hh and T hh are exposed on translinear scales, 
and the inconsistency of using P lln and together mat- 

ters. One way to keep the order of perturbation theory con- 
sistent would be to calculate all functions to third order, a 
laborious task that might not give a more accurate result. 
Instead, we assume, as before, that P hh = P lh \ but exclude 
terms involving functions of higher than first order in the 
halo- model trispectrum (the 3h and 4h terms). 

Figure [4] shows that cutting out the largest haloes from 
the mass function for all quantities indeed shifts the informa- 
tion plateau to smaller scales, giving tighter error bars. We 
also show the results if all terms are included in the trispec- 
trum and no mass cut-off is made; this shows that the lh 
and 2h terms do dominate the halo-model trispectrum in 
this case. 

There are many practical problems which would make 
it difficult to constrain cosmological parameters by remov- 
ing large haloes from the analysis. Halo masses are difficult 
to measure, and measuring all halo masses in a survey seems 
nearly impossible. But perhaps something as simple as the 
number of galaxies in a halo is well-enough correlated with 
halo mass to make a dent in the translinear plateau. Other 
problems could include inadequate knowledge of halo power 
spectra on non-linear scales, and the effects on a survey mask 
from excising haloes. Still, the information gains with mass 



cut-offs are dramatic enough that it seems to be worth ex- 
ploring how to get around these issues. 



4 CONCLUSION 

In the context of the halo model, we find that the mat- 
ter power spectrum is rather disappointing for cosmologi- 
cal parameter estimation on scales smaller than linear. On 
translinear scales (k ~ 1 — 10 /iMpc" 1 ), there is a high 
degree of intrinsic (co)variance in the matter power spec- 
trum caused by cosmic-variance fluctuations in the number 
of large haloes, which suppress the information in any pa- 
rameter of current interest on those scales. There is informa- 
tion on even smaller scales if each parameter is varied alone, 
but we find that this information is quite degenerate among 
various cosmological parameters. This is because changing 
any parameter affects the small-scale matter power spec- 
trum in a similar way, close to a uniform shift up or down 
in the one-halo term. There could be more independent in- 
formation in the tilt on small scales, but one method we 
used (involving halofit) to calculate the non-linear power 
spectrum predicts that the information in the tilt is also 
somewhat degenerate. 

Our results suggest that useful cosmological informa- 
tion is scant below linear scales in the full matter power 
spectrum, but this is probably not the case for all large-scale 
structure statistics. For example, we show that in the halo 
model, the power spectrum of matter outside of large haloes 
has an information plateau on smaller scales than does the 
full matter power spectrum, allowing tighter constraints on 
cosmological parameters. There are practical problems with 
this specific approach, but it raises hopes that there are ways 
to circumvent the covariance and degeneracies among cos- 
mological parameters which haloes introduce on non-linear 
scales. In addition, the smallest scales of the galaxy power 
spectrum contain information about galaxy formation. De- 
spite the apparent difficulties we have found, we remain 
confident that worthwhile information exists on non-linear 
scales in the spatial distribution of certain sets of galaxies, 
matter, or even haloes themselves. 
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